智能论文笔记

Forecasting West Nile Virus with Graph Neural Networks: Harnessing Spatial Dependence in Irregularly Sampled Geospatial Data

Adam Tonks , Trevor Harris , Bo Li , William Brown , Rebecca Smith

分类：机器学习

2022-12-21

Machine learning methods have seen increased application to geospatial environmental problems, such as precipitation nowcasting, haze forecasting, and crop yield prediction. However, many of the machine learning methods applied to mosquito population and disease forecasting do not inherently take into account the underlying spatial structure of the given data. In our work, we apply a spatially aware graph neural network model consisting of GraphSAGE layers to forecast the presence of West Nile virus in Illinois, to aid mosquito surveillance and abatement efforts within the state. More generally, we show that graph neural networks applied to irregularly sampled geospatial data can exceed the performance of a range of baseline methods including logistic regression, XGBoost, and fully-connected neural networks.

translated by 谷歌翻译

3D Neuron Morphology Analysis

Jiaxiang Jiang , Michael Goebel , Cezar Borba , William Smith , B. S. Manjunath

分类：计算机视觉

2022-12-14

We consider the problem of finding an accurate representation of neuron shapes, extracting sub-cellular features, and classifying neurons based on neuron shapes. In neuroscience research, the skeleton representation is often used as a compact and abstract representation of neuron shapes. However, existing methods are limited to getting and analyzing "curve" skeletons which can only be applied for tubular shapes. This paper presents a 3D neuron morphology analysis method for more general and complex neuron shapes. First, we introduce the concept of skeleton mesh to represent general neuron shapes and propose a novel method for computing mesh representations from 3D surface point clouds. A skeleton graph is then obtained from skeleton mesh and is used to extract sub-cellular features. Finally, an unsupervised learning method is used to embed the skeleton graph for neuron classification. Extensive experiment results are provided and demonstrate the robustness of our method to analyze neuron morphology.

translated by 谷歌翻译

Learning Task Requirements and Agent Capabilities for Multi-agent Task Allocation

Bo Fu , William Smith , Denise Rizzo , Matthew Castanier , Maani Ghaffari , Kira Barton

分类：机器人

2022-11-07

This paper presents a learning framework to estimate an agent capability and task requirement model for multi-agent task allocation. With a set of team configurations and the corresponding task performances as the training data, linear task constraints can be learned to be embedded in many existing optimization-based task allocation frameworks. Comprehensive computational evaluations are conducted to test the scalability and prediction accuracy of the learning framework with a limited number of team configurations and performance pairs. A ROS and Gazebo-based simulation environment is developed to validate the proposed requirements learning and task allocation framework in practical multi-agent exploration and manipulation tasks. Results show that the learning process for scenarios with 40 tasks and 6 types of agents uses around 12 seconds, ending up with prediction errors in the range of 0.5-2%.

translated by 谷歌翻译

BlenderBot 3: a deployed conversational agent that continually learns to responsibly engage

Kurt Shuster , Jing Xu , Mojtaba Komeili , Da Ju , Eric Michael Smith , Stephen Roller , Megan Ung , Moya Chen , Kushal Arora , Joshua Lane

分类：自然语言处理 | 人工智能

2022-08-05

我们提出了Blenderbot 3，这是一个175B参数对话模型，能够通过访问Internet和长期内存进行开放域对话，并接受了大量用户定义的任务的培训。我们同时发布了模型权重和代码，还将模型部署在公共网页上，以与有机用户进行交互。该技术报告描述了该模型的构建方式（建筑，模型和培训计划）以及其部署的细节，包括安全机制。人类评估表明，它优于现有的开放域对话代理，包括其前身（Roller等，2021； Komeili等，2022）。最后，我们使用部署收集的数据详细介绍了持续学习的计划，该数据也将公开发布。因此，该研究计划的目标是使社区能够研究通过互动学习的不断改进的负责任的代理商。

translated by 谷歌翻译

Neural apparent BRDF fields for multiview photometric stereo

Meghna Asthana , William A. P. Smith , Patrik Huber

分类：计算机视觉

2022-07-14

我们建议使用以光源方向为条件的神经辐射场（NERF）的扩展来解决多视光度立体声问题。我们神经表示的几何部分预测表面正常方向，使我们能够理解局部表面反射率。我们的神经表示的外观部分被分解为神经双向反射率函数（BRDF），作为拟合过程的一部分学习，阴影预测网络（以光源方向为条件），使我们能够对明显的BRDF进行建模。基于物理图像形成模型的诱导偏差的学到的组件平衡使我们能够远离训练期间观察到的光源和查看器方向。我们证明了我们在多视光学立体基准基准上的方法，并表明可以通过NERF的神经密度表示可以获得竞争性能。

translated by 谷歌翻译

Rotation-Equivariant Conditional Spherical Neural Fields for Learning a Natural Illumination Prior

James A. D. Gardner , Bernhard Egger , William A. P. Smith

分类：计算机视觉

2022-06-07

逆渲染是一个不适的问题。以前的工作试图通过重点关注对象或场景形状或外观的先验来解决这一问题。在这项工作中，我们专注于自然照明的先验。当前方法依赖于球形谐波照明或其他通用表示，充其量是参数的简单先验。我们提出了一个有条件的神经场表示，基于带有警报网络的变异自动描述器，并扩展向量神经元，直接将其构建到网络中。使用此功能，我们开发了一个旋转等值的高动态范围（HDR）神经照明模型，该模型紧凑并且能够表达自然环境图的复杂，高频特征。在自然场景的1.6k HDR环境图的策划数据集上训练我们的模型，我们将其与传统表示形式进行了比较，证明了其适用于反向渲染任务，并通过部分观察显示了环境图的完成。可以在jadgardner.github.io/reni上找到我们的数据集和训练有素的模型。

translated by 谷歌翻译

GatorTron: A Large Clinical Language Model to Unlock Patient Information from Unstructured Electronic Health Records

Xi Yang , Aokun Chen , Nima PourNejatian , Hoo Chang Shin , Kaleb E Smith , Christopher Parisien , Colin Compas , Cheryl Martin , Mona G Flores , Ying Zhang

分类：自然语言处理 | 人工智能 | 机器学习

2022-02-02

There is an increasing interest in developing artificial intelligence (AI) systems to process and interpret electronic health records (EHRs). Natural language processing (NLP) powered by pretrained language models is the key technology for medical AI systems utilizing clinical narratives. However, there are few clinical language models, the largest of which trained in the clinical domain is comparatively small at 110 million parameters (compared with billions of parameters in the general domain). It is not clear how large clinical language models with billions of parameters can help medical AI systems utilize unstructured EHRs. In this study, we develop from scratch a large clinical language model - GatorTron - using >90 billion words of text (including >82 billion words of de-identified clinical text) and systematically evaluate it on 5 clinical NLP tasks including clinical concept extraction, medical relation extraction, semantic textual similarity, natural language inference (NLI), and medical question answering (MQA). We examine how (1) scaling up the number of parameters and (2) scaling up the size of the training data could benefit these NLP tasks. GatorTron models scale up the clinical language model from 110 million to 8.9 billion parameters and improve 5 clinical NLP tasks (e.g., 9.6% and 9.5% improvement in accuracy for NLI and MQA), which can be applied to medical AI systems to improve healthcare delivery. The GatorTron models are publicly available at: https://catalog.ngc.nvidia.com/orgs/nvidia/teams/clara/models/gatortron_og.

translated by 谷歌翻译

Neural Radiance Fields for Outdoor Scene Relighting

Viktor Rudnev , Mohamed Elgharib , William Smith , Lingjie Liu , Vladislav Golyanik , Christian Theobalt

分类：计算机视觉

2021-12-09

照片中的户外场景的照片拟实的编辑需要对图像形成过程的深刻理解和场景几何，反射和照明的准确估计。然后可以在保持场景Albedo和几何形状的同时进行照明的微妙操纵。我们呈现NERF-OSR，即，基于神经辐射场的户外场景复兴的第一种方法。与现有技术相比，我们的技术允许仅使用在不受控制的设置中拍摄的户外照片集合的场景照明和相机视点。此外，它能够直接控制通过球面谐波模型所定义的场景照明。它还包括用于阴影再现的专用网络，这对于高质量的室外场景致密至关重要。为了评估所提出的方法，我们收集了几个户外站点的新基准数据集，其中每个站点从多个视点拍摄和不同的时间。对于每个定时，360度环境映射与颜色校准Chequerboard一起捕获，以允许对实际真实的真实数据进行准确的数值评估。反对本领域的状态的比较表明，NERF-OSR能够以更高的质量和逼真的自阴影再现来实现可控的照明和视点编辑。我们的方法和数据集将在https://4dqv.mpi-inf.mpg.de/nerf-OSR/上公开可用。

translated by 谷歌翻译

Robust Task Scheduling for Heterogeneous Robot Teams under Capability Uncertainty

Bo Fu , William Smith , Denise Rizzo , Matthew Castanier , Maani Ghaffari , Kira Barton

分类：机器人

2021-06-23

本文为多代理系统开发了一个随机编程框架，在该系统中，任务分解，分配和调度问题同时被优化。该框架可以应用于具有分布式子任务的异质移动机器人团队。例子包括大流行机器人服务协调，探索和救援以及具有异质车辆的交付系统。由于其固有的灵活性和鲁棒性，多代理系统被应用于越来越多的现实问题，涉及异质任务和不确定信息。大多数以前的作品都采用一种将任务分解为角色的独特方法，以后可以将任务分配给代理。对于角色可以变化并且存在多个分解结构的复杂任务，此假设无效。同时，尚不清楚如何在多代理系统设置下系统地量化和优化任务要求和代理能力中的不确定性。提出了复杂任务的表示形式：代理功能表示为随机分布的向量，任务要求通过可推广的二进制函数验证。在目标函数中选择有风险的条件值（CVAR）作为制定强大计划的度量。描述了一种有效的算法来解决该模型，并在两个不同的实践案例中评估了整个框架：在大流行期间的捕获量和机器人服务协调（例如，Covid-19）。结果表明，该框架是可扩展的，可扩展到示例案例的140个代理和40个任务，并提供了低成本计划，以确保成功的概率很高。

translated by 谷歌翻译

Competency Problems: On Finding and Removing Artifacts in Language Data

Matt Gardner , William Merrill , Jesse Dodge , Matthew E. Peters , Alexis Ross , Sameer Singh , Noah A. Smith

分类：自然语言处理

2021-04-17

最近在NLP中的工作已经记录了输入功能和输出标签之间的DataSet工件，偏置和虚假相关性。但是，如何判断哪些功能具有“虚假”而不是合法相关性通常留下未指定。在这项工作中，我们认为，对于复杂的语言理解任务，所有简单的特征相关性都是虚假的，我们将这一概念正式化为一类我们称之为能力问题的问题。例如，自己的“惊人”一词不应提供关于情绪标签的信息，无论出现的背景，哪些内容都可以包括否定，隐喻，讽刺等。我们理论上分析创建能力问题数据的难度当考虑人类偏见时，显示现实数据集将越来越偏离能力问题，因为数据集大小增加。此分析为我们提供了一个简单的数据集工件统计测试，我们用于显示比在事先工作中描述的更细微的偏见，包括展示模型与这些不太极端的偏差影响不恰当地影响。我们对此问题的理论处理也允许我们分析所提出的解决方案，例如将本地编辑为数据集实例制作，并为未来的数据收集和模型设计努力提供目标能力问题的建议。

translated by 谷歌翻译